PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa03g006610.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 717aa    MW: 78770.8 Da    PI: 5.6223
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa03g006610.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.51.3e-1965121157
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                     +++ +++t+ q++e+e++F+++++p+ ++r++L+++lgL+  qVk+WFqN+R+++k+
  Csa03g006610.1  65 KKRYHRHTQLQIQEMEAFFKECPHPDDTQRKQLSRELGLEPLQVKFWFQNKRTQMKN 121
                     688999************************************************995 PP

2START218.62.1e-682524652206
                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                     la +a++el+++a++++ +W++++   + +e+ ++f+++ +     +++ea+r+++vv+m++++ ve+l+d++ qW++ +a    +a+tl+v+s+
  Csa03g006610.1 252 LAVAAMEELMRMAQVDDSLWKSLV--FDDEEYARTFPRGIGprpagFRSEASRETAVVIMNHVNIVEILMDVN-QWSTIFAgmvsRAMTLAVLST 343
                     6789********************..************999********************************.********************* PP

                     T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SS CS
           START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgr 175
                     g      galq+m+ae+q++splvp R+ +f+Ry++q+g+g+w++vd+S+ds q++p     +R++++ Sg+li++++ng+skvtwvehv++++r
  Csa03g006610.1 344 GvagnfnGALQVMTAEFQVPSPLVPtRETYFARYCKQQGDGSWAVVDISLDSLQPNPP----ARCRRRASGCLIQEMPNGYSKVTWVEHVEVDDR 434
                     *********************************************************8....********************************* PP

                     XXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 176 lphwllrslvksglaegaktwvatlqrqcek 206
                      +h+l++++v++g+a+gak+wva l+rqce+
  Csa03g006610.1 435 GVHSLYKHMVSTGHAFGAKRWVAILDRQCER 465
                     *****************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.8E-2143120IPR009057Homeodomain-like
SuperFamilySSF466895.43E-1953122IPR009057Homeodomain-like
PROSITE profilePS5007116.29362122IPR001356Homeobox domain
SMARTSM003893.7E-1863126IPR001356Homeobox domain
CDDcd000861.67E-1865123No hitNo description
PfamPF000463.3E-1765120IPR001356Homeobox domain
PROSITE patternPS00027097120IPR017970Homeobox, conserved site
PROSITE profilePS5084841.829242468IPR002913START domain
SuperFamilySSF559611.28E-33242467No hitNo description
CDDcd088751.62E-121246464No hitNo description
SMARTSM002342.6E-58251465IPR002913START domain
PfamPF018522.5E-60252465IPR002913START domain
Gene3DG3DSA:3.30.530.203.0E-5341432IPR023393START-like domain
SuperFamilySSF559617.19E-26484708No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010090Biological Processtrichome morphogenesis
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 717 aa     Download sequence    Send to blast
MFEPNMLLAA MNNADSNNHN YNHEDNNNEG FLRDDEFDSA NTKSGSENQE GGSGNDQDPL  60
HPNKKKRYHR HTQLQIQEME AFFKECPHPD DTQRKQLSRE LGLEPLQVKF WFQNKRTQMK  120
NHHERHENSH LRAENEKLRN DNLRYREALA NASCPNCGGP TAIGEMSFDE HQLRLENARL  180
REEIDRISAI AAKYVGKPVS NYPLMSPPPL PPRPLELGMG NLGGEAYGNN PTDLLKSITT  240
PTEADKPVII DLAVAAMEEL MRMAQVDDSL WKSLVFDDEE YARTFPRGIG PRPAGFRSEA  300
SRETAVVIMN HVNIVEILMD VNQWSTIFAG MVSRAMTLAV LSTGVAGNFN GALQVMTAEF  360
QVPSPLVPTR ETYFARYCKQ QGDGSWAVVD ISLDSLQPNP PARCRRRASG CLIQEMPNGY  420
SKVTWVEHVE VDDRGVHSLY KHMVSTGHAF GAKRWVAILD RQCERLASVM ATNISSGEVG  480
VITNQEGRRS MLKLAERMVI SFCAGVSAST AHTWTTLSGT GAEDVRVMTR KSVDDPGRPP  540
GIVLSAATSF WIPVPPKRVF DFLRDENSRN EWDILSNGGV VQEMAHIANG RDTGNCVSLL  600
RSANSSQSNM LILQESCTDP TASFVIYAPV DIVAMNIVLN GGDPDYVALL PSGFAILPDG  660
NANGGGDGGS LLTVAFQILV DSVPTAKLSL GSVATVNNLI ACTVERIKAS MSCETA*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK3167760.0AK316776.1 Arabidopsis thaliana AT1G05230 mRNA, complete cds, clone: RAFL09-78-H10.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010484829.10.0PREDICTED: homeobox-leucine zipper protein HDG2 isoform X2
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLB3H6Y40.0B3H6Y4_ARATH; Homeobox-leucine zipper protein HDG2
STRINGAT1G05230.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.30.0homeodomain GLABROUS 2